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ENHANCED WIDE DYNAMIC RANGE IN IMAGING 

FIELD OF THE INVENTION 

[0001] The present invention relates to image enhancement. More particularly it relates 
to enhancing wide dynamic range in imaging. 

BACKGROUND OF THE INVENTION 

[0002] When a photograph is taken with an object of particular illumination with 
background of significantly higher illumination most imaging apparatuses fail to record 
the composite image in same detail accuracy. Either the background appears in great 
details and the object in front of the background appears poorly lit, or the object is 
shown in great details and the background appears over-exposed. 

[0003] In order to address this problem the photographic sensor has to be able to exhibit 
a very wide dynamic range, alas photographic sensors in general are limited in the 
dynamic range they are sensitive to. 

[0004] It is a purpose of the present invention to introduce a novel method and 
apparatus for enhancing the dynamic range in images, employing a virtual binary 
illumination mask. 

SUMMARY OF THE INVENTION 

[0005] There is thus provided, in accordance with a preferred embodiment of the 
present invention, a method for enhancing wide dynamic range in images, the method 
comprising: 

[0006] acquiring at least two images of a scene to be imaged, the images acquired using 
different exposure times; 

[0007] constructing for a first image of said at least two images an illumination mask 
comprising a set of weight values distinctively identifying respective areas of pixels of 
high or low illumination, over-exposed or underexposed with respect to a predetermined 
threshold illumination value, assigning one of the weight values to each pixels, whereas 
other weight value is assigned to other pixels of the other of said at least two images; 
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[0008] using a low-pass filter to smooth border zones between pixels of one weight 
value and pixels of other weight value, thus assigning pixels in the border zones new 
weight values in a range between the weight values; 

[0009] constructing a combined image using image data of pixels assigned with one 
weight value of the first image and image data of pixels assigned with other weight 
value of the other of said at least two images and in pixels corresponding to the border 
zones using image data from said at least two images proportional to the new weight 
values. 

[0010] Furthermore, in accordance with a preferred embodiment of the present 
invention, the weight values are binary values. 

[0011] Furthermore, in accordance with a preferred embodiment of the present 
invention, the acquired images are in JPEG format, the JPEG format including a DCT 
transform domain. 

[0012] Furthermore, in accordance with a preferred embodiment of the present 
invention, the step of constructing the combined image is carried out in the DCT 
transform domain. 

[0013] Furthermore, in accordance with a preferred embodiment of the present 
invention, the following relation is used in the constructing the combined image: 
[00 i4] = o(/ £. ) * /^w + (1 _ a(/ £. )} * joct^ . RatiQ ^ 

[0015] where /£*is the DC coefficient of the DCT transform of the relatively longer 
exposure image, a is a weight representing the illumination mask and Ratio is a measure 
that defines the relation between the images of different exposure exposures, and p, q 
are DCT coefficients and * represents convolution. 

[0016] Furthermore, in accordance with a preferred embodiment of the present 
invention, only first few DCT coefficients are used in calculating the relation. 
[0017] Furthermore, in accordance with a preferred embodiment of the present 
invention, wherein p=l and q=l. 

[0018] Furthermore, in accordance with a preferred embodiment of the present 
invention, wherein, for color imaging, the steps of claim 1 are carried our separately for 
each color plane. 
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[0019] Furthermore, in accordance with a preferred embodiment of the present 
invention, the method further comprises: 

[0020] detecting pixels in said at least two images indicative of motion by comparing 
corresponding image data from said at least two images; 

[0021] evaluating image data value for pixels identified as indicative of motion using 
image data from one of said at least two images and using the image data value in 
constructing the combined image. 

[0022] Furthermore, in accordance with a preferred embodiment of the present 
invention, the step of detecting pixels indicative of motion comprises looking for pixels 
for which the ratio I Long ll Long is beyond a predetermined threshold, t ong is image 
data from one of said at least two images which was acquired with longest exposure 

I Short - Ratio where I short Ratio <255 , 

, where Ratio is a 

255 else 
measure that defines the relation between the images of different exposure. 
[0023] Furthermore, in accordance with a preferred embodiment of the present 
invention, the step of constructing a combined image includes using for pixels identified 
as indicative of motion only image data from one of said at least two images. 
[0024] Furthermore, in accordance with a preferred embodiment of the present 
invention, the image data from one of said at least two images is reconstructed to 
simulate corresponding pixels in the other of said at least two images. 
[0025] Furthermore, in accordance with a preferred embodiment of the present 
invention, the method also includes using image data from one of said at least two 
images which was acquired with longest exposure time incorporated in two illumination 
masks. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0026] In order to better understand the present invention, and appreciate its 

practical applications, the following Figures are provided and referenced hereafter. It 
should be noted that the Figures are given as examples only and in no way limit the 
scope of the invention. Like components are denoted by like reference numerals. 



time, and I Long =• 
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[0027] Fig. 1 illustrates the dynamic ranges associated with "short" and "long" 
exposures. 

[0028] Fig. 2 illustrates a proposed function for combined "short" and "long" exposures, 
in accordance with a preferred embodiment of the present invention. 

[0029] Fig. 3 illustrates a preferred embodiment of the process of enhancing dynamic 
range imaging in accordance with the present invention. 

[0030] Fig. 4 illustrates another preferred embodiment of the process of enhancing 
dynamic range imaging in accordance with the present invention. 
[0031] Fig. 5a is an example of a "long" exposure image. 
[0032] Fig. 5b is an example of a "short" exposure image. 

[0033] Fig. 5c is a threshold image after a threshold filter was applied on the "long" 
exposure image of Fig. 5a. 

[0034] Fig. 5d is an illumination mask image produced by a low-pass filter. 
[0035] Fig. 5e is an enhanced wide dynamic range image made of the "long" and 
"short" exposure images using the illumination mask, in accordance with a preferred 
embodiment of the present invention. 

[0036] Fig. 6a is an enhanced wide dynamic range image produced in a known high 
dynamic range enhancement. 

[0037] Fig. 6b is an enhanced wide dynamic range image produced in accordance with 
a preferred embodiment of the present invention. 

[0038] Fig. 7 illustrates a preferred embodiment of the wide dynamic range 
enhancement method in accordance with a preferred embodiment of the present 
invention. 

[0039] Fig. 8 illustrates the companding process in accordance with a preferred 
embodiment of the present invention. 

[0040] Fig. 9 illustrates the structure of JPEG DCT transform matrix. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

[0041] Ideally, 10-14 bits per pixel would represent the wide dynamic range input 
signal. However, common photographic sensor devices do not exhibit such wide 
dynamic range signals. 

[0042] An aspect of the present invention is the use of multiple exposures in the 
acquisition process of an image. The same scene is imaged more than once (in most 
cases, two exposures are sufficient). One exposure is made at a low level of sensitivity 
(e.g. with an electronic shutter set at a short exposure time). That "short" exposure 
contains highlight details, but most dark image areas are lost in the noise. A second (or 
further) exposure of the same image is taken at a relatively high level of sensitivity (e.g. 
at a long exposure time). The "long" exposure contains details of the darker parts of the 
image, but the brighter area may come out saturated, without any details. 

[0043] Another aspect of the present invention, subsequent to the acquisition, is the 
combination of the images in a predetermined manner so as to produce a single, wide 
dynamic range, image. The predetermined manner consists of identifying in the 
acquired images areas of high or low illumination and mapping them and treating them 
separately - using image data from the short exposure image in areas of high 
illumination, and using image data from the long exposure image in areas of low 
illumination. Furthermore, in interim border zones between the areas (or in fact where it 
is desired so), a combination of weighted values from the image data of both short and 
long exposure images is used. 

[0044] The wide range image may preferably be further processed, for example by 
using dynamic range compression algorithm, to reduce the dynamic range down to a 
useful level - normally 8 bits. 

[0045] The JPEG format is one of the main image compression tools used worldwide. It 
is suggested that the two images of "long" and "short" exposures be acquired by a 
camera and be stored in the JPEG format. In following we will describe a method for a 
construction of the wide dynamic range image from these images directly in the JPEG 
domain. Such an approach allows one to save a significant amount of computation 
efforts and reduce large portion of the memory because it doesn't require an explicit 



opening of the compressed images and is applied on the pixel (or more precisely JPEG 
block) basis. Note, however, that the present invention is not limited to JPEG formats 
only and is in fact independent of the image compression format used (if at all). 

[0046] Suppose that a given scene with wide dynamic range of illumination is 
represented by the two images: f ong and f hor \ where f ong is an image of the scene with 
a "long" exposure and f hort is the image of the scene with a "short" exposure. The long 
exposure provides image details in the dark areas, while the short exposure provides the 
details in bright areas. These images are related by the following: 



[0047] I Long = 



\I Short • Ratio where I short • Ratio < 255 

I 255 else 



[0048] This relation means that except for noise and quantization issues "long" and 

"short" images are linearly related. Hence, combining t° ng and f hort images with the 

following composing function does the construction of the wide dynamic range image. 

[0049] The construction is performed on a pixel-by-pixel basis by: 

[0050] I WDR = a(I Long ) - I Long + (1 - a(I Long )) • I Short • Ratio (1) 

[0051] In the JPEG domain (or DCT transform) the images are given by: 

A ,r -i x&n+Vp x(2n + l}q. 0 < 

^.„ cos ( — )cos( — '* ), 

q 



t dct «« r ,x(2m + l)p. ,x(2n + \)q^ 0<p<M-l 

I DO =a D a a ) > 7 mw cos(— — )cos(— —\ 

p 'htl m > n 2M 2N h 0<q<N-l 

[0052] 



a - < 

9 



!/— q = 0 



[0053] where M and N are the row and column size of DCT block (8x8 pixels), 
respectively and a p and a q are respective transform normalization coefficients. 
[0054] It is evident that in terms of its cosine coefficients the DCT transform is a linear 
function. Therefore taking into account a linear relation between the f ong and f hort 
images it is possible to construct a wide dynamic range image directly in the DCT 
transform domain using relation (1) by replacing a multiplication by convolution 
operator. 

[0055] =a(I% 8 )*I^ +0-«</£ , ))*/,y~ Ratio (2) 
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[0056] where I L ™ g is the DC coefficient of the DCT transform of the "long" image and 
"*" designates convolution operation. Taking into an account a very smooth nature of 
the a(Ioc) function it is possible to use only a small number of DCT coefficients. For 
some images with high frequency content it might be necessary to use a DC component 
for the a, with DCT coefficients p=\,q = l. 

[0057] Reference is now made to the figures in order to demonstrate the approach 
described above. To display the resultant wide dynamic range image, simple gamma 
function has been applied as post-processing to the obtained resulting image. 

[0058] The initial wide dynamic range image is obtained by combining "long" and 
"short" images by the use of so called "illumination mask". The illumination mask is 
calculated based on strongly low-pass filtered version of the thresholded "long" image. 
Applying a threshold to the image removes delicate details and therefore the resultant 
image represents a kind of illumination, which was present during the image 
acquisition. Saturated portions of the image representing bright lighting indicate that 
"short" exposure should be used in these regions. A subsequent low-pass filter smoothes 
the illumination mask enabling soft transition between the data present in "long" and 
"short" images. In order to efficiently implement the above procedure a subsampled 
version of the low-pass filter is used. First, the filter is applied to the decimated 
(subsampled) version of the "long" signal. Eight times decimation is used for the better 
fit to the JPEG signal organization. /™ g signal is decimated to obtain I^° e " c 8 8 . (The factor 

of 8 is chosen to be consistent with the blocks definition of the DCT transform used in 
JPEG compression). Then a four-directional IIR (Infinite Impulse Response) filter of the 
first order is applied: 

[0059] I LP = I LP + a ■ (j£* 8 - I LP ) 

[0060] where a is an IIR coefficient, chosen to be 0.05 and I L D ° e n c \ is either a "long" 
signal itself or its clipped version as explained previously. 

[0061] Note that although in the explanation hereinabove the "long" image data is 
treated first with the illumination mask and the "short" image data is used in the final 
combining stage, it is possible to apply the illumination mask on the "short" image data 
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and use the "long" image data in the final combining step. In other words: the order in 
which the "long" and "sort" images are treated bears no significance. 

[0062] Since the illumination mask bears strong low-pass filtered signal characteristics 
it is possible simply extend the illumination mask to the full grid by simple bilinear 
interpolation, which might be performed in very efficient and fast way. The resultant 
illumination mask will be designated by co. Thus the resultant wide dynamic range 
signal will be composed by: 

[0063] I m * = I Long + co • {I short - I Long ) (1) 

[0064] Noise and color or JPEG artifacts are significantly reduced in the new approach 
of the present invention. 

[0065] The described approach to the construction of the wide range image might be 
implemented directly in the JPEG domain. Using it in the JPEG domain eliminates the 
need for additional signal decompression and recompression and therefore provides 
significant computational savings. 

[0066] We examine equation (1) with respect to the DCT image block of 8x8 pixels. 
For such a block, assuming that the "illumination mask" is constant per block, equation 
(1) might be implemented directly in the JPEG domain (in the same way for each DCT 
coefficient). However, this assumption is not correct and will lead to the strong 
characteristic JPEG blockiness effect. On the other hand using the fact that the co is not 
constant, but has a linear plane form, since it was obtained from the bilinear 
interpolation of the low-pass filter on the sparse grid, enables one to reproduce equation 
(1) in the JPEG domain with some approximation. The equation (1) incorporated with 
the DCT transformation is as follows: 

[0067] J DCTwDR = j DCTl °«s + yy DCT * ^j DCT sh or t _j DCT ^g^ ^2) 

[0068] where * designates convolution. Equation (2) was obtained using the fact that 
multiplication operator becomes a convolution under the Fourier transform. The W° CT 
term in equation (2) represents a DCT transform of the 8x8 plane, which has the most 
significant coefficients only at first column and row (as seen in the following example). 

60.9553 -1.0903 -0.0612 -0.1090 0.0000 -0.0362 -0.0043 -0.0063 

-0.5233 -0.0031 0.0017 -0.0005 -0.0000 -0.0000 0.0001 -0.0001 

-0.0422 -0.0010 0.0001 -0.0001 0.0000 -0.0000 0.0000 -0.0000 

-0.0809 -0.0010 0.0003 -0.0001 0.0000 -0.0000 0.0000 -0.0000 
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-0.0202 -0.0005 0.0001 -0.0001 -0.0000 -0.0000 

-0.0255 -0.0003 0.0001 -0.0000 -0.0000 -0.0000 

-0.0044 -0.0001 0.0000 -0.0000 0.0000 -0.0000 

-0.0049 -0.0000 0.0000 -0.0000 -0.0000 -0.0000 



0.0000 -0.0000 

0.0000 -0.0000 

0.0000 -0.0000 

0.0000 -0.0000 



[0069] Moreover, we leave only a DC coefficient and two first AC coefficients from the 
above matrix for the convolution in equation (2) (see Fig. 9). Additionally, only four 
first coefficients of the DCT block are preferably used in the convolution with DC and 
AC coefficients. For the rest of values in the DCT block we use only a DC term. Such 
approximations tremendously reduce the number of operations required for using 
equation (2), while not causing any noticeable effect on the image quality. The number 
of operations is therefore: 

[0070] N c ™ = 3*4 + 60 = 72 

[0071] for convolution and together with the rest of computation is: 



[0072] N™* x * 



= ArJ; v +2*64 = 200 



[0073] In total this gives us 200 operations per 8x8 pixel block or about 3 arithmetic 
operations per pixel, which does not have zero value. Since, usually a significant portion 
of quantized DCT coefficients has zero values the total number of operations is even 
much lower. Moreover, because of the internal structure of the JPEG compression, zero 
coefficients are readily discarded from the bit-stream, therefore eliminating the need for 
performing the "If command to detect zero values. 

[0074] The following demonstrates the difference between full implementation of (2) 
and its approximated version as described above. 
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[0075] The upper block is a resultant DCT block for the full implementation of the 
equation (2). 

[0076] The lower block is a resultant DCT block for the approximation of equation (2). 
[0077] Color images are reconstructed in the similar way. The composing of a wide 
dynamic range image according to equation (2) is performed on each color plane 
separately, using same co function for all colors. The input to co is a DC component of 
the Y channel in the YCrCb case or Green channel in the RGB case. 
[0078] Since a wide dynamic range image in accordance with the present invention is 
composed from two sequential exposures ("long" and "short") it is possible that there 
will be either local intra-scene or global inter-scene motion during the image 
acquisition. If not taken care, this motion can cause strong artifacts in the final image, 
degrading its quality. 

[0079] To resolve the motion problem the following procedure is proposed: 
[0080] Based on the estimated exposure Ratio (Ratio is defined by the mean of the 
relation of "long" image pixels to respective "short", which are not saturated and not in 
cut-off) a new "long" image is evaluated from the "short" for every pixel as: 



[0082] Then the acquired I Long and evaluated I Long images are compared per pixel 
basis and if I Long II Long ratio is above or below certain thresholds the corresponding 
pixel is declared as motion affected pixel and is inserted into a motion mask index 
image. Depending on the image quality the thresholds might be adapted and varied for 
different intensity levels. 

[0083] In order to make motion estimation more robust it is suggested to perform 
motion detection either on the estimated luminance channel or on the maximal 
difference image from R, G, B components. The luminance is obtained as: 
[0084] Y = (R + 2G + B)/4 

[0085] while the maximal difference image is obtained as: 




• Ratio 



where I 



else 



Short 



• Ratio < 255 



[0086] max (\I L R ong -i? ng , lj? ttg -i£ ng , I L B ong ) 

R,G,B I 
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[0087] If there is too much motion present in the image (i.e., motion count is too high), 
then the global motion is detected and the resultant image is produced either from one 
of the exposures - "long" or "short" (whichever is closer to the normal exposure) or 
from the normal image if available. (By "normal" is meant a regular image acquired 
under an average exposure conditions, which leads to underexposed and saturated 
portions of the scene in the image). 

[0088] Next the motion mask is subjected to an order filter (a kind of a median filter), 
which effectively removes small, unconnected pixels leaving only significant parts of 
the mask, which represent motion. Specifically, the filter operates on 5x5 patch and uses 
17 th element from a sorted array of the motion mask. Since the motion mask has a 
binary representation a fast implementation of the filter is possible by using look-up 
tables. In order to speed processing further the decimation by factor of 2 is used during 
the whole process of motion detection. 

[0089] The above filtered motion mask is then processed with a low-pass filter in order 
to enable smooth transition from I Long to i Long . The filter is a simple low-pass FIR 
comprising of 7x7 mask with all ones. 

[0090] Finally the value of I Long is replaced with the evaluated value of I Long through 
the use of the motion mask: 

[0091] If£* = I Long + MotionMask • (I Long - I Long ) 

[0092] It should be noted that the above procedure for motion detection and 
compensation might be directly applied in the JPEG domain. As described previously 
only four most significant coefficients of the DCT transform might be used for the 
signal reconstruction and subsequent motion detection. The evaluation of I long might 
be performed directly on the DCT coefficients according to: 

[0093] I Long = I Short Ratio 

[0094] In cases where the "short" image does not bear sufficient information (due to 
cut-off condition) to be used for constructing "long" image, a respecting "long" image is 
used even if in the area of motion it is being saturated. This is done by incorporation of 
the motion area indication in to the illumination mask prior to the low-pass filter 
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application. In such a way smearing of the motion region boarders leads to "soft" 
integration of the "long" image information in the resultant image. 

[0095] Eliminating the need for the initial construction of the wide dynamic range 
image with a diapason of greater than 8 bit, enables application of the method of the 
present invention in cases where the camera (or any other imaging device) is not exactly 
calibrated, has rather higher noise levels and does not require linearity of the sensor. 
The suggested method of the present invention also does not require the estimation of 
the exposures ratio, which is a necessary step in previous methods. The noise and color 
artifacts characteristic to the construction of the wide dynamic range image are greatly 
reduced as well as the blockiness effect of the JPEG compression. 
[0096] Moreover, the suggested method of the present invention in a preferred 
embodiment is capable of a very efficient implementation directly in the JPEG domain. 
The adaptation is possible due to the utilization of linear properties of the composition 
method (through the use of the illumination mask) used for the creation of wide 
dynamic range image. Such an approach allows one to obtain a significant savings in 
computation time and memory resources required for the acquisition of the wide 
dynamic range image. 

[0097] A combination of the wide dynamic range image creation with an efficient 
implementation in the JPEG domain provides a feasible solution to the imaging on 
various portable devices with limited resources. 

[0098] The method proposed in this invention is directly applicable to the MPEG 
compressed movies. 

[0099] Hereinafter we refer to implementation of the present invention directly in the 
JPEG domain. 

[00100] Representative DCT blocks of the "long", "short" and reconstructed wide 
dynamic range images are shown next: 
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-23 -3 3 -4 2 -3 -3 -3 -13 6 1 3 -3 -5 0 -1 

13 2 0 -8 -7 3 4 2 2 -2 5 -2 -1 3 0 3 

a) DCT block of the "long" image b) DCT block of the "short" image 



2600 60 24 -48 36 76 4 4 

80 -56 16 8 12 -36 -8 -24 

-76 -32 -32 36 12 12 8 12 

44 -28 20 28 0 -16 -16 0 

-32 24 28 32 -36 -56 12 -8 

40 8 -16 -8 12 56 -8 4 

-52 24 4 12 -12 -20 0 4 

8 -8 20 -8 -4 12 0 12 



c) Composed DCT block of the wide dynamic range image 

[00101] Color images are reconstructed in the similar way. The composing of the 
wide dynamic range image according to equation (2) is performed on each plane 
separately, using same a function for all colors. The input to a is a DC component of 
the Y channel in the YCrCb case or Green channel in the RGB case. 
[00102] Following the construction of the wide dynamic range image (for 
example of 12 bit) from "long" and "short" images it is necessary to convert it to a 
normal dynamic range image (usually of 8 bits) which can be displayed on a regular 
monitor. Such conversion is usually performed by the operation called "companding", 
which is basically applying a gamma function to the wide dynamic range image. 
[00103] However, companding tends to wash out image details, especially in the 
bright areas since in these regions gamma function has very flat response. In order to 
overcome this an operation similar to "unsharp masking" is made on the image prior to 
companding. This operation strips the details from the wide dynamic range image and 
then after the gamma function is applied to the low-pass version of the wide dynamic 
range image, full scale details are added back and the resultant image has a contrast 
appearance. The whole procedure is demonstrated in figure 7. 

[00104] / Re * = T + a{I)\l WDR - /) (3) 
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[00105] We described hereinbefore the procedure for converting the wide 
dynamic range signal would be performed in the image domain. Here it is however 
implemented directly in the JPEG domain on the DCT transformed signal. 

[00106] To implement the above procedure in the JPEG domain, the terms in 
equation (3) are regrouped differently. 

[00107] 7 R " =a{I).I WDR + a{I).F -1 = a(I)-I WDR + /(/) (4 ) 

[001 08] where/fi) is a composite function of/- low-pass image. 

[00109] Since the resultant f image in (4) is a linear combination of the o?f m 

and f(I) it is possible to consider (4) in the DCT domain, i.e. 

[00110] I"*- = a(I)-I DCT >™ + f DCr (I) (5) 

[00111] Examining equation (5) it seen that the resultant image in the DCT 
domain is a combination of the DCT transform of the f DR and the DCT transform of a 
composite function of the low-pass image. The low-pass image in the DCT domain is 
represented by its DC coefficients (in 8x8 pixel blocks); therefore it is reasonable to 
assume that we can approximate the low-pass of the complete image by making a 
bilinear (or bi-cubic version) interpolation of its DC components. Then by making a 
forward DCT transform of the interpolated low-pass image Xhtf CT (l) term in equation 
(5) is produced. For actual computation it is not necessary to perform the whole 
interpolation and DCT decomposition because their combination is a well-defined 
operation that might be precomputed and then applied as a look-up table to four 
surrounding points of the DC component in the bilinear interpolation. Alternatively, the 
full low-pass image might be constructed by using not only the DC terms but the first 
order components of the DCT transform thus making even better approximation to the 
second term in equation (5). 

[00112] The resultant image f es is produced directly in the JPEG domain thus 
eliminating the need for a decompression of the "long" and "short" images and 
subsequent re-compression of the result, ensuing significant computation time and 
memory savings. 

[00113] Depending on the extent of the dynamic range of the scene it might be 
necessary to enhance differently the details in the resultant f es image. When the 
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dynamic range is small (Ratio between "long" and "short" images close to 1) the 
enhancement of the details in bright and dark areas should be similar, because the 
gamma function in the companding function is almost linear. However for the wide 
dynamic range images a compounding the gamma function is strong, thus making 
details enhancement dependant on the local image brightness. This correction of details 
enhancement might be easily incorporated into our scheme of the wide dynamic range 
compression. Since for each 8x8 pixels block the DCT transform provides a ready-made 
decomposition to low-pass and high-pass frequencies, it is possible to enhance 
progressively higher frequencies in increasing order by simply amplifying the DCT 
coefficients. 

[00114] This may be done either directly by multiplying the coefficients in a 
zigzag order of the I DCT ™ image in every block, or simply to adjust the coefficients 
quantization table for the subsequent JPEG decompression. It is even possible to vary 
the extent of the enhancement according to the DC value of the current block, or the 
value of the interpolated low-pass signal. 

[00115] To sum up the process: in the first stage a wide dynamic range image is 
constructed based on the "long" and "short" exposure images using DCT transform 
coefficients. The construction is possible due to the utilization of linear properties of the 
composition method used for the creation of wide dynamic range image. Such an 
approach allows one to obtain a significant savings in computation time and memory 
resources required for the acquisition of wide dynamic range image. 
[00116] In the second stage a compression of wide dynamic range image into the 
normal image, displayable on the monitor, is performed using a direct JPEG domain 
based method. The proposed scheme enables to preserve image details usually lost 
during the standard methods utilizing companding functions and even to perform an 
enhancement of the image details. 

[00117] A combination of wide dynamic range image creation with its 
subsequent conversion into a clear and detailed normal image provides a feasible 
solution to the imaging on various portable devices with limited resources. 

[00118] The method proposed in the present invention is directly applicable to 
the MPEG compressed movies. 
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[00119] It should be clear that the description of the embodiments and attached 
Figures set forth in this specification serves only for a better understanding of the 
invention, without limiting its scope as covered by the following Claims and their 
equivalents. 

[00120] It should also be clear that a person skilled in the art, after reading the 
present specification could make adjustments or amendments to the attached Figures 
and above described embodiments that would still be covered by the following Claims 
and their equivalents. 



